Improving Arabic Text Categorization Using Neural Network with SVD

نویسندگان

  • Fouzi Harrag
  • Eyas Al-Qawasmah
چکیده

In this paper, we present a model based on the Neural Network (NN) for classifying Arabic texts. We propose the use of Singular Value Decomposition (SVD) as a preproces-sor of NN to reduce the data in terms of both size as well as dimensionality so that the input data become more classifiable and faster for the convergence of the training process used in the NN model. To test the effectiveness of the proposed model, experiments were conducted using an in-house collected Arabic corpus for text categorization. The results showed that the proposed model was able to achieve high categorization effectiveness as measured by precision, recall and F-measure. Experimental result shows that the ANN model using SVD is better than the basic ANN on Arabic text classification.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Analysis of the Arabic using neural networks: an overview

This paper is a quick review of some of the scholarly work aiming at solving various problems of the Arabic language using neural networks. It includes some research work concerning online recognition of handwritten Arabic characters, speech recognition, offline character text recognition, text categorization and recognition of printed text. This paper concludes that more research should be con...

متن کامل

Arabic News Articles Classification Using Vectorized-Cosine Based on Seed Documents

Besides for its own merits, text classification (TC) has become a cornerstone in many applications. Work presented here is part of and a pre-requisite for a project we have overtaken to create a corpus for the Arabic text process. It is an attempt to create modules automatically that would help speed up the process of classification for any text categorization task. It also serves as a tool for...

متن کامل

Big Data Categorization for Arabic Text Using Latent Semantic Indexing and Clustering

Documents categorization is an important field in the area of natural language processing. In this paper, we propose using Latent Semantic Indexing (LSI), singular value decomposing (SVD) method, and clustering techniques to group similar unlabeled document into pre-specified number of topics. The generated groups are then categorized using a suitable label. For clustering, we used Expectation–...

متن کامل

Prediction of Deformation of Circular Plates Subjected to Impulsive Loading Using GMDH-type Neural Network

In this paper, experimental responses of the clamped mild steel, copper, and aluminium circular plates are presented subjected to blast loading. The GMDH-type neural networks (Group Method of Data Handling) are then used for the modelling of the mid-point deflection thickness ratio of the circular plates using those experimental results. The aim of such modelling is to show how the mid-point de...

متن کامل

Semantic Addressable Encoding

This paper presents an automatic acquisition process to acquire the semantic meaning for the words. This process obtains the representation vectors for stemmed words by iteratively improving the vectors, using a trained Elman network [4]. Experiments performed on a corpus composed of Shakespeare’s writings show its linguistic analysis and categorization abilities.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • JDIM

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2010